Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Zeitungsdigitalisierung: eine neue Herausforderung für die ULB Halle: Werkstattbericht aus der Pilotphase des DFG-Projekts ?Digitalisierung historischer Zeitungen"

Identifieur interne : 000140 ( Main/Exploration ); précédent : 000139; suivant : 000141

Zeitungsdigitalisierung: eine neue Herausforderung für die ULB Halle: Werkstattbericht aus der Pilotphase des DFG-Projekts ?Digitalisierung historischer Zeitungen"

Auteurs : Dorothea Sommer [Allemagne] ; Kay Heiligenhaus [Allemagne] ; Manfred Pankratz [Allemagne] ; Carola Wippermann [Allemagne]

Source :

RBID : Pascal:14-0279350

Descripteurs français

English descriptors

Abstract

Newspaper digitization and an appropriate presentation on the internet is still a challenge in Germany, where the production and distribution of newspapers have a long and distinctive tradition. The article presents the current findings of a newspaper digitization project that is being carried out at Halle University and State library. The project is part of a Pilot phase of newspaper digitization supported by the German Research Foundation, in which various German libraries are cooperating, each with its own task and technical approach. The project in Halle focuses on the digitization of the newspaper "Hallisches Tageblatt", which was founded in the end of the 18th century and ceased publication in 1892. The article describes methods and results of an OCR test for Gothic (Fraktur) printing types and structural indexing. It also deals with the investigation of opportunities for establishing routines of persistent addressing of periodicals and excerpts from them in order to enable and facilitate reliable and long-term valid digital citation practices for the academic community. This part of the project is being realized in collaboration with the German National Library. The procedures are based on Uniform Resource Names (URNs), which perform as Persistent Identifiers. The application of URNs for periodicals is an extended version of the harvesting routines of URN Granular, which have already been implemented successfully for monographs.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="GER" level="a">Zeitungsdigitalisierung: eine neue Herausforderung für die ULB Halle: Werkstattbericht aus der Pilotphase des DFG-Projekts ?Digitalisierung historischer Zeitungen"</title>
<author>
<name sortKey="Sommer, Dorothea" sort="Sommer, Dorothea" uniqKey="Sommer D" first="Dorothea" last="Sommer">Dorothea Sommer</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Amtierende Direktorin Universitäts- und Landesbibliothek Sachsen-Anhalt, August-Bebel-Strasse 13</s1>
<s2>06098 Halle (Saale)</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<wicri:noRegion>06098 Halle (Saale)</wicri:noRegion>
<wicri:noRegion>August-Bebel-Strasse 13</wicri:noRegion>
<wicri:noRegion>06098 Halle (Saale)</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Heiligenhaus, Kay" sort="Heiligenhaus, Kay" uniqKey="Heiligenhaus K" first="Kay" last="Heiligenhaus">Kay Heiligenhaus</name>
<affiliation wicri:level="3">
<inist:fA14 i1="02">
<s1>Geschäftsführer semantics Kommunikationsmanagement GmbH, Viktoriaallee 45</s1>
<s2>52066 Aachen</s2>
<s3>DEU</s3>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<placeName>
<region type="land" nuts="1">Rhénanie-du-Nord-Westphalie</region>
<region type="district" nuts="2">District de Cologne</region>
<settlement type="city">Aix-la-Chapelle</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Pankratz, Manfred" sort="Pankratz, Manfred" uniqKey="Pankratz M" first="Manfred" last="Pankratz">Manfred Pankratz</name>
<affiliation wicri:level="1">
<inist:fA14 i1="03">
<s1>Mikrofilmarchiv der deutschsprachigen Presse (MFA) e. V., Max-von-der Grün-Platz</s1>
<s2>44122 Dortmund</s2>
<s3>DEU</s3>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<wicri:noRegion>44122 Dortmund</wicri:noRegion>
<wicri:noRegion>Max-von-der Grün-Platz</wicri:noRegion>
<wicri:noRegion>44122 Dortmund</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Wippermann, Carola" sort="Wippermann, Carola" uniqKey="Wippermann C" first="Carola" last="Wippermann">Carola Wippermann</name>
<affiliation wicri:level="1">
<inist:fA14 i1="03">
<s1>Mikrofilmarchiv der deutschsprachigen Presse (MFA) e. V., Max-von-der Grün-Platz</s1>
<s2>44122 Dortmund</s2>
<s3>DEU</s3>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<wicri:noRegion>44122 Dortmund</wicri:noRegion>
<wicri:noRegion>Max-von-der Grün-Platz</wicri:noRegion>
<wicri:noRegion>44122 Dortmund</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">14-0279350</idno>
<date when="2014">2014</date>
<idno type="stanalyst">PASCAL 14-0279350 INIST</idno>
<idno type="RBID">Pascal:14-0279350</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000002</idno>
<idno type="stanalyst">FRANCIS 14-0279350 INIST</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000033</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000763</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000023</idno>
<idno type="wicri:doubleKey">0720-6763:2014:Sommer D:zeitungsdigitalisierung:eine:neue</idno>
<idno type="wicri:Area/Main/Merge">000141</idno>
<idno type="wicri:Area/Main/Curation">000140</idno>
<idno type="wicri:Area/Main/Exploration">000140</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="GER" level="a">Zeitungsdigitalisierung: eine neue Herausforderung für die ULB Halle: Werkstattbericht aus der Pilotphase des DFG-Projekts ?Digitalisierung historischer Zeitungen"</title>
<author>
<name sortKey="Sommer, Dorothea" sort="Sommer, Dorothea" uniqKey="Sommer D" first="Dorothea" last="Sommer">Dorothea Sommer</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Amtierende Direktorin Universitäts- und Landesbibliothek Sachsen-Anhalt, August-Bebel-Strasse 13</s1>
<s2>06098 Halle (Saale)</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<wicri:noRegion>06098 Halle (Saale)</wicri:noRegion>
<wicri:noRegion>August-Bebel-Strasse 13</wicri:noRegion>
<wicri:noRegion>06098 Halle (Saale)</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Heiligenhaus, Kay" sort="Heiligenhaus, Kay" uniqKey="Heiligenhaus K" first="Kay" last="Heiligenhaus">Kay Heiligenhaus</name>
<affiliation wicri:level="3">
<inist:fA14 i1="02">
<s1>Geschäftsführer semantics Kommunikationsmanagement GmbH, Viktoriaallee 45</s1>
<s2>52066 Aachen</s2>
<s3>DEU</s3>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<placeName>
<region type="land" nuts="1">Rhénanie-du-Nord-Westphalie</region>
<region type="district" nuts="2">District de Cologne</region>
<settlement type="city">Aix-la-Chapelle</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Pankratz, Manfred" sort="Pankratz, Manfred" uniqKey="Pankratz M" first="Manfred" last="Pankratz">Manfred Pankratz</name>
<affiliation wicri:level="1">
<inist:fA14 i1="03">
<s1>Mikrofilmarchiv der deutschsprachigen Presse (MFA) e. V., Max-von-der Grün-Platz</s1>
<s2>44122 Dortmund</s2>
<s3>DEU</s3>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<wicri:noRegion>44122 Dortmund</wicri:noRegion>
<wicri:noRegion>Max-von-der Grün-Platz</wicri:noRegion>
<wicri:noRegion>44122 Dortmund</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Wippermann, Carola" sort="Wippermann, Carola" uniqKey="Wippermann C" first="Carola" last="Wippermann">Carola Wippermann</name>
<affiliation wicri:level="1">
<inist:fA14 i1="03">
<s1>Mikrofilmarchiv der deutschsprachigen Presse (MFA) e. V., Max-von-der Grün-Platz</s1>
<s2>44122 Dortmund</s2>
<s3>DEU</s3>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<wicri:noRegion>44122 Dortmund</wicri:noRegion>
<wicri:noRegion>Max-von-der Grün-Platz</wicri:noRegion>
<wicri:noRegion>44122 Dortmund</wicri:noRegion>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">ABI - Technik</title>
<title level="j" type="abbreviated">ABI - Tech.</title>
<idno type="ISSN">0720-6763</idno>
<imprint>
<date when="2014">2014</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">ABI - Technik</title>
<title level="j" type="abbreviated">ABI - Tech.</title>
<idno type="ISSN">0720-6763</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Digitizing</term>
<term>Electronic document</term>
<term>Germany</term>
<term>Newspaper</term>
<term>Project</term>
<term>University library</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Allemagne</term>
<term>Bibliothèque universitaire</term>
<term>Journal</term>
<term>Projet</term>
<term>Numérisation</term>
<term>Document électronique</term>
</keywords>
<keywords scheme="Wicri" type="geographic" xml:lang="fr">
<term>Allemagne</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Bibliothèque universitaire</term>
<term>Journal</term>
<term>Numérisation</term>
<term>Document électronique</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Newspaper digitization and an appropriate presentation on the internet is still a challenge in Germany, where the production and distribution of newspapers have a long and distinctive tradition. The article presents the current findings of a newspaper digitization project that is being carried out at Halle University and State library. The project is part of a Pilot phase of newspaper digitization supported by the German Research Foundation, in which various German libraries are cooperating, each with its own task and technical approach. The project in Halle focuses on the digitization of the newspaper "Hallisches Tageblatt", which was founded in the end of the 18
<sup>th</sup>
century and ceased publication in 1892. The article describes methods and results of an OCR test for Gothic (Fraktur) printing types and structural indexing. It also deals with the investigation of opportunities for establishing routines of persistent addressing of periodicals and excerpts from them in order to enable and facilitate reliable and long-term valid digital citation practices for the academic community. This part of the project is being realized in collaboration with the German National Library. The procedures are based on Uniform Resource Names (URNs), which perform as Persistent Identifiers. The application of URNs for periodicals is an extended version of the harvesting routines of URN Granular, which have already been implemented successfully for monographs.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Allemagne</li>
</country>
<region>
<li>District de Cologne</li>
<li>Rhénanie-du-Nord-Westphalie</li>
</region>
<settlement>
<li>Aix-la-Chapelle</li>
</settlement>
</list>
<tree>
<country name="Allemagne">
<noRegion>
<name sortKey="Sommer, Dorothea" sort="Sommer, Dorothea" uniqKey="Sommer D" first="Dorothea" last="Sommer">Dorothea Sommer</name>
</noRegion>
<name sortKey="Heiligenhaus, Kay" sort="Heiligenhaus, Kay" uniqKey="Heiligenhaus K" first="Kay" last="Heiligenhaus">Kay Heiligenhaus</name>
<name sortKey="Pankratz, Manfred" sort="Pankratz, Manfred" uniqKey="Pankratz M" first="Manfred" last="Pankratz">Manfred Pankratz</name>
<name sortKey="Wippermann, Carola" sort="Wippermann, Carola" uniqKey="Wippermann C" first="Carola" last="Wippermann">Carola Wippermann</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000140 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000140 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:14-0279350
   |texte=   Zeitungsdigitalisierung: eine neue Herausforderung für die ULB Halle: Werkstattbericht aus der Pilotphase des DFG-Projekts ?Digitalisierung historischer Zeitungen"
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024